Search CORE

16 research outputs found

A method for comparing multiple imputation techniques: A case study on the U.S. national COVID cohort collaborative.

Author: Blau Hannah
Bramante Carolyn T
Buse John B
Callahan Tiffany J
Casiraghi Elena
Chan Lauren E
Coleman Ben D
Evans Michael D
Hall Margaret
Huling Jared D
Johnson Steven G
Laraway Bryan
Moffitt Richard A
Notaro Marco
Paccanaro Alberto
Raymond Shao Yu
Reese Justin
Robinson Peter N
Stürmer Til
Tronieri Jena S
Valentini Giorgio
Wilkins Kenneth J
Wong Rachel
Publication venue: The Mouseion at the JAXlibrary
Publication date: 01/03/2023
Field of study

Healthcare datasets obtained from Electronic Health Records have proven to be extremely useful for assessing associations between patients’ predictors and outcomes of interest. However, these datasets often suffer from missing values in a high proportion of cases, whose removal may introduce severe bias. Several multiple imputation algorithms have been proposed to attempt to recover the missing information under an assumed missingness mechanism. Each algorithm presents strengths and weaknesses, and there is currently no consensus on which multiple imputation algorithm works best in a given scenario. Furthermore, the selection of each algorithm’s pa- rameters and data-related modeling choices are also both crucial and challenging

The Jackson Laboratory: The Mouseion at the JAXlibrary

The Monarch Initiative in 2024: an analytic platform integrating phenotypes, genes and diseases across species.

Author: Alquaddoomi Faisal S
Braun Ian
Bruskiewich Richard M
Cappelletti Luca
Carbon Seth
Caron Anita R
Caufield J Harry
Chan Lauren E
Chute Christopher G
Cortes Katherina G
Cox Corey
De Souza Vinícius
Elsarboukh Glass
Fontana Tommaso
Gehrke Sarah
Haendel Melissa A
Harris Nomi L
Hartley Emily L
Hegde Harshad
Hurwitz Eric
Jacobsen Julius O B
Krishnamurthy Madan
Laraway Bryan J
Matentzoglu Nicolas
McLaughlin James A
McMurry Julie A
Moxon Sierra A T
Mullen Kathleen R
Mungall Christopher J
Munoz-Torres Monica C
O\u27Neil Shawn T
Osumi-Sutherland David
Putman Tim E
Reese Justin T
Robinson Peter N
Rubinetti Vincent P
Schaper Kevin
Shefchek Kent A
Smedley Damian
Stefancsik Ray
Toro Sabrina
Vasilevsky Nicole A
Walls Ramona L
Whetzel Patricia L
Publication venue: The Mouseion at the JAXlibrary
Publication date: 05/01/2024
Field of study

Bridging the gap between genetic variations, environmental determinants, and phenotypic outcomes is critical for supporting clinical diagnosis and understanding mechanisms of diseases. It requires integrating open data at a global scale. The Monarch Initiative advances these goals by developing open ontologies, semantic data models, and knowledge graphs for translational research. The Monarch App is an integrated platform combining data about genes, phenotypes, and diseases across species. Monarch\u27s APIs enable access to carefully curated datasets and advanced analysis tools that support the understanding and diagnosis of disease for diverse applications such as variant prioritization, deep phenotyping, and patient profile-matching. We have migrated our system into a scalable, cloud-based infrastructure; simplified Monarch\u27s data ingestion and knowledge graph integration systems; enhanced data mapping and integration standards; and developed a new user interface with novel search and graph navigation features. Furthermore, we advanced Monarch\u27s analytic tools by developing a customized plugin for OpenAI\u27s ChatGPT to increase the reliability of its responses about phenotypic data, allowing us to interrogate the knowledge in the Monarch graph using state-of-the-art Large Language Models. The resources of the Monarch Initiative can be found at monarchinitiative.org and its corresponding code repository at github.com/monarch-initiative/monarch-app

The Jackson Laboratory: The Mouseion at the JAXlibrary

The Human Phenotype Ontology in 2024: phenotypes around the world.

Author: Addo-Lartey Eunice B
Anagnostopoulos Anna V
Anderton Joel
Avillach Paul
Bagley Anita M
Bakštein Eduard
Balhoff James P
Baynam Gareth
Bello Susan M
Berk Michael
Bertram Holli
Bishop Somer
Blau Hannah
Bodenstein David F
Botas Pablo
Boztug Kaan
Callahan Tiffany J
Cameron Rhiannon
Carbon Seth J
Carmody Leigh
Castellanos Francisco
Caufield J Harry
Chan Lauren E
Chute Christopher G
Coleman Ben D
Cruz-Rojo Jaime
Dahan-Oliel Noémi
Danis Daniel
Davids Jon R
de Dieuleveult Maud
de Souza Vinicius
de Vries Bert B A
de Vries Esther
DePaulo J Raymond
Derfalvi Beata
Dhombres Ferdinand
Diaz-Byrd Claudia
Dingemans Alexander J M
Donadille Bruno
Duyzend Michael
Elfeky Reem
Essaid Shahim
Fabrizzi Carolina
Fico Giovanna
Firth Helen V
Freudenberg-Hua Yun
Fullerton Janice M
Gabriel Davera L
Gargano Michael
Gilmour Kimberly
Giordano Jessica
Goes Fernando S
Green Ian
Griese Matthias
Groza Tudor
Gu Weihong
Guthrie Julia
Gyori Benjamin
Haendel Melissa A
Hamosh Ada
Hanauer Marc
Hanušová Kateřina
Harris Nomi L
He Yongqun Oliver
Hegde Harshad
Helbig Ingo
Holasová Kateřina
Hoyt Charles Tapley
Huang Shangzhi
Hurwitz Eric
Jacobsen Julius O B
Jiang Xiaofeng
Joseph Lisa
Keramatian Kamyar
King Bryan
Knoflach Katrin
Koolen David A
Kraus Megan L
Kroll Carlo
Kusters Maaike
Köhler Sebastian
Ladewig Markus S
Lagorce David
Lai Meng-Chuan
Lapunzina Pablo
Laraway Bryan
Lewis-Smith David
Li Xiarong
Lucano Caterina
Majd Marzieh
Marazita Mary L
Martinez-Glez Victor
Matentzoglu Nicolas
McHenry Toby H
McInnis Melvin G
McMurry Julie A
Mihulová Michaela
Millett Caitlin E
Mitchell Philip B
Moses Rachel Gore
Moslerová Veronika
Mungall Christopher J
Munoz-Torres Monica C
Narutomi Kenji
Nematollahi Shahrzad
Nevado Julian
Nierenberg Andrew A
Nurnberger John I
Ogishima Soichi
Olson Daniel
Ortiz Abigail
Pachajoa Harry
Perez de Nanclares Guiomar
Peters Amy
Putman Tim
Rapp Christina K
Rath Ana
Reese Justin
Rekerle Lauren
Roberts Angharad M
Robinson Peter N
Roy Suzy
Sanders Stephan J
Schuetz Catharina
Schulte Eva C
Schulze Thomas G
Schwarz Martin
Scott Katie
Seelow Dominik
Seitz Berthold
Shen Yiping
Similuk Morgan N
Simon Eric S
Singh Balwinder
Smedley Damian
Smith Cynthia
Smolinsky Jake T
Sperry Sarah
Stafford Elizabeth
Stefancsik Ray
Steinhaus Robin
Strawbridge Rebecca
Sundaramurthi Jagadish Chandrabose
Talapova Polina
Tenorio Castano Jair A
Tesner Pavel
Thomas Rhys H
Thurm Audrey
Toro Sabrina
Turnovec Marek
van Gijn Marielle E
Vasilevsky Nicole A
Vlčková Markéta
Walden Anita
Wang Kai
Wapner Ron
Ware James S
Wiafe Addo A
Wiafe Samuel A
Wiggins Lisa D
Williams Andrew E
Wu Chen
Wyrwoll Margot J
Xiong Hui
Yalin Nefize
Yamamoto Yasunori
Yatham Lakshmi N
Yocum Anastasia K
Young Allan H
Yüksel Zafer
Zandi Peter P
Zankl Andreas
Zarante Ignacio
Zvolský Miroslav
Čady Jolana
Čajbiková Nikola Novák
Publication venue: The Mouseion at the JAXlibrary
Publication date: 05/01/2024
Field of study

The Human Phenotype Ontology (HPO) is a widely used resource that comprehensively organizes and defines the phenotypic features of human disease, enabling computational inference and supporting genomic and phenotypic analyses through semantic similarity and machine learning algorithms. The HPO has widespread applications in clinical diagnostics and translational research, including genomic diagnostics, gene-disease discovery, and cohort analytics. In recent years, groups around the world have developed translations of the HPO from English to other languages, and the HPO browser has been internationalized, allowing users to view HPO term labels and in many cases synonyms and definitions in ten languages in addition to English. Since our last report, a total of 2239 new HPO terms and 49235 new HPO annotations were developed, many in collaboration with external groups in the fields of psychiatry, arthrogryposis, immunology and cardiology. The Medical Action Ontology (MAxO) is a new effort to model treatments and other measures taken for clinical management. Finally, the HPO consortium is contributing to efforts to integrate the HPO and the GA4GH Phenopacket Schema into electronic health records (EHRs) with the goal of more standardized and computable integration of rare disease data in EHRs

The Jackson Laboratory: The Mouseion at the JAXlibrary

Risk of new-onset psychiatric sequelae of COVID-19 in the early and late post-acute phase.

Author: Blau Hannah
Callahan Tiffany J
Casiraghi Elena
Chan Lauren
Coleman Ben D
Deer Rachel R
Haendel Melissa A
Laraway Bryan
Reese Justin
Robinson Peter N
Wilkins Kenneth J
Publication venue: 'Wiley'
Publication date: 01/06/2022
Field of study

The Jackson Laboratory: The Mouseion at the JAXlibrary

PubMed Central

Predictive models of long COVIDResearch in context

Author: Andrew E. Williams
Blessy Antony
Bryan J. Laraway
Christopher Chute
Corneliu C. Antonescu
Elena Casiraghi
Giorgio Valentini
Hannah Blau
Johanna J. Loomba
Justin T. Reese
Kenneth J. Wilkins
Peter N. Robinson
T.M. Murali
Tiffany J. Callahan
Publication venue: Elsevier
Publication date: 04/09/2023
Field of study

Summary: Background: The cause and symptoms of long COVID are poorly understood. It is challenging to predict whether a given COVID-19 patient will develop long COVID in the future. Methods: We used electronic health record (EHR) data from the National COVID Cohort Collaborative to predict the incidence of long COVID. We trained two machine learning (ML) models — logistic regression (LR) and random forest (RF). Features used to train predictors included symptoms and drugs ordered during acute infection, measures of COVID-19 treatment, pre-COVID comorbidities, and demographic information. We assigned the ‘long COVID’ label to patients diagnosed with the U09.9 ICD10-CM code. The cohorts included patients with (a) EHRs reported from data partners using U09.9 ICD10-CM code and (b) at least one EHR in each feature category. We analysed three cohorts: all patients (n = 2,190,579; diagnosed with long COVID = 17,036), inpatients (149,319; 3,295), and outpatients (2,041,260; 13,741). Findings: LR and RF models yielded median AUROC of 0.76 and 0.75, respectively. Ablation study revealed that drugs had the highest influence on the prediction task. The SHAP method identified age, gender, cough, fatigue, albuterol, obesity, diabetes, and chronic lung disease as explanatory features. Models trained on data from one N3C partner and tested on data from the other partners had average AUROC of 0.75. Interpretation: ML-based classification using EHR information from the acute infection period is effective in predicting long COVID. SHAP methods identified important features for prediction. Cross-site analysis demonstrated the generalizability of the proposed methodology. Funding: NCATS U24 TR002306, NCATS UL1 TR003015, Axle Informatics Subcontract: NCATS-P00438-B, NIH/NIDDK/OD, PSR2015-1720GVALE_01, G43C22001320007, and Director, Office of Science, Office of Basic Energy Sciences of the U.S. Department of Energy Contract No. DE-AC02-05CH11231

The Jackson Laboratory: The Mouseion at the JAXlibrary

Directory of Open Access Journals

The IDeaS initiative: Pilot study to assess the impact of rare diseases on patients and healthcare systems

Author: Chan Chun-Hung
Cutillo Christine M
Dawkins Hugh
Griese Emily
Haendel Melissa
Hasche Cindy
Laraway Bryan
Nathan Ramaa
Nowak Douglas
Pariser Anne R
Pearce David A
Russo Pierantonio
Rutter Joni L
Shukla Oodaye
Tisdale Ainslie
Publication venue: ResearchOnline@ND
Publication date: 01/01/2021
Field of study

Background: Rare diseases (RD) are a diverse collection of more than 7–10,000 different disorders, most of which affect a small number of people per disease. Because of their rarity and fragmentation of patients across thousands of different disorders, the medical needs of RD patients are not well recognized or quantified in healthcare systems (HCS). Methodology: We performed a pilot IDeaS study, where we attempted to quantify the number of RD patients and the direct medical costs of 14 representative RD within 4 different HCS databases and performed a preliminary analysis of the diagnostic journey for selected RD patients. Results: The overall findings were notable for: (1) RD patients are difficult to quantify in HCS using ICD coding search criteria, which likely results in under-counting and under-estimation of their true impact to HCS; (2) per patient direct medical costs of RD are high, estimated to be around three–fivefold higher than age-matched controls; and (3) preliminary evidence shows that diagnostic journeys are likely prolonged in many patients, and may result in progressive, irreversible, and costly complications of their disease Conclusions: The results of this small pilot suggest that RD have high medical burdens to patients and HCS, and collectively represent a major impact to the public health. Machine-learning strategies applied to HCS databases and medical records using sentinel disease and patient characteristics may hold promise for faster and more accurate diagnosis for many RD patients and should be explored to help address the high unmet medical needs of RD patients

ResearchOnline@ND

Directory of Open Access Journals

Metformin is associated with reduced COVID-19 severity in patients with prediabetes.

Author: Antony Blessy
Blau Hannah
Bramante Carolyn
Casiraghi Elena
Chan Lauren E
Coleman Ben
Gargano Michael
Haendel Melissa
Harris Nomi L
Laraway Bryan
N3C consortium
Reese Justin
Robinson Peter N
Sahner David
Valentini Giorgio
Wilkins Kenneth
Zaman Adnin
Publication venue: eScholarship, University of California
Publication date: 30/08/2022
Field of study

AimsStudies suggest that metformin is associated with reduced COVID-19 severity in individuals with diabetes compared to other antihyperglycemics. We assessed if metformin is associated with reduced incidence of severe COVID-19 for patients with prediabetes or polycystic ovary syndrome (PCOS), common diseases that increase the risk of severe COVID-19.MethodsThis observational, retrospective study utilized EHR data from 52 hospitals for COVID-19 patients with PCOS or prediabetes treated with metformin or levothyroxine/ondansetron (controls). After balancing via inverse probability score weighting, associations with COVID-19 severity were assessed by logistic regression.ResultsIn the prediabetes cohort, when compared to levothyroxine, metformin was associated with a significantly lower incidence of COVID-19 with "mild-ED" or worse (OR [95% CI]: 0.636, [0.455-0.888]) and "moderate" or worse severity (0.493 [0.339-0.718]). Compared to ondansetron, metformin was associated with lower incidence of "mild-ED" or worse severity (0.039 [0.026-0.057]), "moderate" or worse (0.045 [0.03-0.069]), "severe" or worse (0.183 [0.077-0.431]), and "mortality/hospice" (0.223 [0.071-0.694]). For PCOS, metformin showed no significant differences in severity compared to levothyroxine, but was associated with a significantly lower incidence of "mild-ED" or worse (0.101 [0.061-0.166]), and "moderate" or worse (0.094 [0.049-0.18]) COVID-19 outcome compared to ondansetron.ConclusionsMetformin use is associated with less severe COVID-19 in patients with prediabetes or PCOS

The Jackson Laboratory: The Mouseion at the JAXlibrary

PubMed Central

eScholarship - University of California

Recommended from our members

The Monarch Initiative: An integrative data and analytic platform connecting phenotypes to genotypes across species

Author: Balhoff James
Borromeo Charles
Brush Matthew
Carbon Seth
Conlin Tom
Dunn Nathan
Engelstad Mark
Foster Erin
Gourdine JP
Groza Tudor
Haendel Melissa
Hochheiser Harry
Jacobsen Julius OB
Keith Daniel
Köhler Sebastian
Laraway Bryan
Lewis Suzanna
McMurry Julie
Mungall Christopher
Nguyen Xuan Jeremy
Robinson Peter
Shefchek Kent
Smedley Damian
Vasilevsky Nicole
Washington Nicole
Yuan Zhou
Publication venue: eScholarship, University of California
Publication date: 01/01/2016
Field of study

The principles of genetics apply across the whole tree of life: on a cellular level, we share mechanisms with species from which we diverged millions or even billions of years ago. We can exploit this common ancestry at the level of sequences, but also in terms of observable outcomes (phenotypes), to learn more about health and disease for humans and all other species. Applying the range of available knowledge to solve challenging disease problems requires unified data relating genomics, phenotypes, and disease; it also requires computational tools that leverage these multimodal data to inform interpretations by geneticists and to suggest experiments. However, the distribution and heterogeneity of databases is a major impediment: databases tend to focus either on a single data type across species, or on single species across data types. Although each database provides rich, high-quality information, no single one provides unified data that is comprehensive across species, biological scales, and data types. Without a big-picture view of the data, many questions in genetics are difficult or impossible to answer. The Monarch Initiative ( https://monarchinitiative.org ) is an international consortium dedicated to providing computational tools that leverage a computational representation of phenotypic data for genotype-phenotype analysis, genomic diagnostics, and precision medicine on the basis of a large-scale platform of multimodal data that is deeply integrated across species and covering broad areas of disease

eScholarship - University of California

Recommended from our members

Navigating the Phenotype Frontier: The Monarch Initiative.

Author: Balhoff James P
Borromeo Charles
Brush Matthew
Carbon Seth
Conlin Tom
Dunn Nathan
Engelstad Mark
Foster Erin
Gourdine Jean-Philippe
Groza Tudor
Haendel Melissa A
Hochheiser Harry
Jacobsen Julius OB
Keith Daniel
Köhler Sebastian
Laraway Bryan
Lewis Suzanna E
McMurry Julie A
Mungall Christopher J
Robinson Peter N
Shefchek Kent
Smedley Damian
Vasilevsky Nicole A
Washington Nicole L
Xuan Jeremy Nguyen
Yuan Zhou
Publication venue: eScholarship, University of California
Publication date: 01/08/2016
Field of study

The principles of genetics apply across the entire tree of life. At the cellular level we share biological mechanisms with species from which we diverged millions, even billions of years ago. We can exploit this common ancestry to learn about health and disease, by analyzing DNA and protein sequences, but also through the observable outcomes of genetic differences, i.e. phenotypes. To solve challenging disease problems we need to unify the heterogeneous data that relates genomics to disease traits. Without a big-picture view of phenotypic data, many questions in genetics are difficult or impossible to answer. The Monarch Initiative (https://monarchinitiative.org) provides tools for genotype-phenotype analysis, genomic diagnostics, and precision medicine across broad areas of disease

eScholarship - University of California

Recommended from our members

The Monarch Initiative: an integrative data and analytic platform connecting phenotypes to genotypes across species.

Author: Balhoff James P
Borromeo Charles
Brush Matthew
Carbon Seth
Conlin Tom
Dunn Nathan
Engelstad Mark
Foster Erin
Gourdine JP
Groza Tudor
Haendel Melissa A
Hochheiser Harry
Jacobsen Julius OB
Keith Dan
Köhler Sebastian
Laraway Bryan
Lewis Suzanna E
McMurry Julie A
Mungall Christopher J
NguyenXuan Jeremy
Robinson Peter N
Shefchek Kent
Smedley Damian
Vasilevsky Nicole
Washington Nicole
Yuan Zhou
Publication venue: eScholarship, University of California
Publication date: 01/01/2017
Field of study

The correlation of phenotypic outcomes with genetic variation and environmental factors is a core pursuit in biology and biomedicine. Numerous challenges impede our progress: patient phenotypes may not match known diseases, candidate variants may be in genes that have not been characterized, model organisms may not recapitulate human or veterinary diseases, filling evolutionary gaps is difficult, and many resources must be queried to find potentially significant genotype-phenotype associations. Non-human organisms have proven instrumental in revealing biological mechanisms. Advanced informatics tools can identify phenotypically relevant disease models in research and diagnostic contexts. Large-scale integration of model organism and clinical research data can provide a breadth of knowledge not available from individual sources and can provide contextualization of data back to these sources. The Monarch Initiative (monarchinitiative.org) is a collaborative, open science effort that aims to semantically integrate genotype-phenotype data from many species and sources in order to support precision medicine, disease modeling, and mechanistic exploration. Our integrated knowledge graph, analytic tools, and web services enable diverse users to explore relationships between phenotypes and genotypes across species

eScholarship - University of California